CDS

Accession Number TCMCG036C06670
gbkey CDS
Protein Id PTQ44220.1
Location join(878374..878499,878874..879003,879557..879663,879990..880124,880240..880343,881426..881526,881812..881948)
GeneID Phytozome:Mapoly0021s0078
Organism Marchantia polymorpha
locus_tag MARPO_0021s0078

Protein

Length 279aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772693.1
Definition hypothetical protein MARPO_0021s0078 [Marchantia polymorpha]
Locus_tag MARPO_0021s0078

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02737        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005839        [VIEW IN EMBL-EBI]
GO:0019774        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGACTCGATTTTAGCGGATTGGATCCCGTCACTTTGCACCGTCCCAAAGCCGATGCCGGTTTCGATTTGCCATGCCAGCCAATCGGCTCTGCTCCTTCGTTCGACTTGCCCGCCGTCGCCGATTTGGATGGGTTCGAGAAAGCAGCTGTGGACATGGTGAAGCCTCTTCATGGAACTACCACCTTGGCCTTCGTTTTTAAGGAAGGCGTTATTGTTGCAGCTGATTCACGAGCAAGCATGGGAAATTACATTTCTTCTCAGAACGTTAAGAAAATTTTGGAGATAAACCCTTATCTTCTAGGAACTATGGCAGGAGGTGCTGCAGATTGTCAATTTTGGCAAAGAAATCTTGGCACACGGTGCCGCCTACATGAACTTGGAAACAAGCGAAGAATTTCCGTGACGGGCGCATCCAAATTACTTGCTAATACTTTATACTCATACCGAGGAATGGGTCTATCCATGGGTACTATGATCGCTGGCTGGGATGAGACTGGCCCAGGCCTTTACTATGTGGACAGTGAAGGAGGTAGAGTCAAGGGAAGGCGGTTTTCCGTTGGATCAGGATCAACGTACGCTTATGGTGTTTTGGATACAGGATTTCACTGGGATATGACCATTGATGAAGCAGTGGAGCTTGCTCGACGCTCTATTTACCATGCGACGTTTCGTGATGGAGCTAGTGGTGGTGTAGCGAGTGTGTACTATGTTGGACCAAATGGTTGGAAGAAGATGTCCGGTGACGACGTAGGGGAGCTTCATTATAAGTACTACCCCGTGTCCGACTCGCCTGCAGAGAAAGAAATTGTAATGAAAGAGGCATCAAGCGCTTCATGA
Protein:  
MRLDFSGLDPVTLHRPKADAGFDLPCQPIGSAPSFDLPAVADLDGFEKAAVDMVKPLHGTTTLAFVFKEGVIVAADSRASMGNYISSQNVKKILEINPYLLGTMAGGAADCQFWQRNLGTRCRLHELGNKRRISVTGASKLLANTLYSYRGMGLSMGTMIAGWDETGPGLYYVDSEGGRVKGRRFSVGSGSTYAYGVLDTGFHWDMTIDEAVELARRSIYHATFRDGASGGVASVYYVGPNGWKKMSGDDVGELHYKYYPVSDSPAEKEIVMKEASSAS